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Abstract 

We present and investigate a general model for inhomogeneous random digraphs with 
labeled vertices, where the arcs are generated independently, and the probability of inserting 
an arc depends on the labels of its endpoints and its orientation. For this model the critical 
point for the emergence of a giant component is determined via a branching process approach. 
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1 Introduction 

Random directed graphs (digraphs) are widely used for modeling networks arising e.g. in physics, 
biology, social studies and, more recently, bioinformatics, linguistics and the analysis of networks 
in the internet. 

It has been observed that the well studied classical homogeneous random digraph model D(n,p), 
where arcs are inserted independently and with the same probability p, may not fit real life 
networks, because the latter often exhibit statistical properties, such as, e.g., power law in- 
degree/out-degree distribution, that are inconsistent with the model D(n,p). Generally, the 
real life networks are inhomogeneous (see, e.g., Albert and Barabasi [1], Dorogovtsev, Mendes 
and Samukhin [3], Durrett [3j, Newman, Strogatz and Watts [TT]). 

In this paper we study a very general model of sparse inhomogeneous random digraphs with 
independent arcs. By 'sparse' we mean that the number of arcs does not grow faster than linearly 
in n, where n is the number of particles (called vertices of the digraph). By inhomogeneity we 
mean that different arcs are inserted with different probabilities. In particular, our random di- 
graph model is able to produce a wide class of asymptotic in-degree and out-degree distributions 
including power law distributions. 

The main question addressed in the paper is the description of the phase transition in the 
(strongly connected) cluster size, that is, the emergence of a giant strongly connected component. 
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To answer this question we establish a first order asymptotic for the number N± of vertices in 
the largest strongly connected component, by showing N\ = pn + op(n). The fraction p > 
is expressed in terms of survival probabilities of related branching processes that reflect the 
statistical properties of a certain neighbourhood of a randomly chosen vertex. 
Let us mention that mathematically rigorous results on phase transition in D(n,p) were es- 
tablished in Karp [7j and Luczak [8], see also Luczak and Seierstad More general three 
parameter models were studied in Luczak and Cohen [9]. The phase transition in a random 
digraph with given (non-random) in-degree and out-degree sequences was shown in Cooper and 
Frieze Jo) . We also want to mention the related in-depth study of the phase transition phe- 
nomenon in general inhomogeneous random graphs in Bollobas, Janson and Riordan [2\, which 
has inspired our work on inhomogeneous digraphs. 

The paper is organized as follows. In Section 2 we define a finite dimensional model of a random 
inhomogeneous digraph and for this model determine the critical point of the phase transition. 
Section 3 extends finite dimensional results to inhomogeneous digraphs defined by very general 
(possibly infinite dimensional) kernels. Proofs are postponed to Section 4. 

2 The finite dimensional model 

Before presenting our results we briefly recall some relevant facts and notation related to di- 
graphs. 

2.1. A digraph D on the vertex set V = {v±, . . . , v n } is a subset of the set [V] 2 = {(u, v),u, v £ V} 
of all ordered pairs of elements of V. Elements of D are called directed edges or arcs. The fact 
that (u, v) is an element of D is denoted {u — > v} £ D or just u — > v. More generally, if for some 
distinct vertices w%, . . . Wk £ V the collection of arcs V = {(u>i, 1^2), {w2,wz), ■ ■ ■ > (wn,^)} 
is a subset of D then V is called a directed path {d-path) starting at w\ and ending at w^, and 
denoted by w% Wk ■ If v w and w v then the vertices v and w are said to communicate. 
In this case we write v <^~> w. In addition, we define that v v, for every v £ V, even in the 
case where the loop v ~^ v is not present in C A digraph is called strongly connected if every 
pair of its vertices communicates. Since ' <~~»' is an equivalence relation, it splits the vertex set V 
into a union of disjoint subsets of elements communicating with each other. The subgraph of D 
induced by such a subset of vertices is called a strongly connected component (SC- component). 
We are interested in the fast growth of the largest SC-component when the density of random arcs 
gradually increases in the range 0(n -1 ). Here the important parameter is the size iVj. = Ni(D) 
of the largest SC-component (the number of vertices of the SC-component, which has the largest 
number of vertices). Another interesting parameter is the size N2 = A^-D) of the second largest 
SC-component. 

2.2. We assume that vertices belong to different types and that the probability of an arc depends 
on the types of its endpoints and the scale parameter n only. In addition, we assume throughout 
this section that the set of different types is finite and as the number of vertices increases. 
A similar model of random graphs (but not digraphs) has been introduced by Soderberg |12j . 
Let us introduce some more notation. Let S = {si, s%, . . . , Sk} denote the set of types, and 
let s(v) denote the type of a vertex v £ V . We write n = n\ + n2 + • • • + n^, where each 
rii = #{v £ V : s(v) = Si} denotes the number of vertices of type £ S. 

Given a integer vector n = (m, . . . , n^) and a k x k matrix P = \ \pij\\ with non-negative entries, 
define the inhomogeneous random digraph T> on the vertex set V as follows. The set of arcs 
of T> is drawn at random from [V] 2 so that events {u — > v} £ T> are independent and have 
probabilities P(u — ► v) = 1 A (pijn^ 1 ), for each (u,v) £ [V] 2 . Here i and j refer to the types 
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S{ = s(u) and Sj = s(v) of the endpoints u and v. We use the notation a A b and a V b for 
min{a, b} and max{a, 6} respectively. In order to stress the dependence of the model on the 
parameters P and n we sometimes write T> = Dp^. 

We shall assume that the fraction of vertices of a given type is asymptotically constant. That 
is, there is a probability distribution Q on the type space S such that for each qi = Q(s{) we 
have 

qi > and rn — qin = o(n) as n — > oo. (1) 

Clearly, for large n, the typical/statistical characteristics of the digraph depend solely on 
the distribution Q and matrix P = ||py||. In order to describe the phase transition in Ppn in 
terms of Q and P we use the "language" of branching processes. 

Let us consider multi-type Galton- Watson processes where particles are of types from S. Given 
s & S, let X(s) (respectively y(s)) denote the Galton- Watson process starting at a particle of 
type s such that the number of children of type Sj G S of a particle of type Sj G S has Poisson 
distribution with mean Pijqj (respectively Pjiqj), 1 < i, j < k. We write X = {X(s), s € 5} 
and y = {y(s), s G S}. Let px(s) and py(s) denote the non-extinction probability of X(s) and 
y(s) respectively. Write 

P = Pxy = Px{si)py{si)qi. (2) 

l<i<fc 

We show in Theorem 1 below that a giant SC-component of range n emerges in T> whenever p 
is positive. 

Theorem 1. Assume that (CP holds. Assume that the matrix P 
n — » oo we have 

Ni(2>p,n) = pxy n + op(n) 

and N2 = op(n). 

Here for a sequence of random variables {Z n } we write Z n = op[n) if lim n P(|Z n | > Sn) = for 
each 5 > 0. 

Recall that the matrix is called irreducible if its associated digraph Dp is strongly con- 

nected. Here Dp is the digraph on the vertex set S such that {sj — > Sj} € -Dp whenever 

Py > 0. 

Remark. In the case where Dp is not strongly connected, Theorem 1 can be applied to its 
strongly connected components. Let D\,. . . , D r denote the strongly connected components 
of Dp. Given 1 < m < r, let S m C S denote the vertex set of D m and let V m denote the 
subgraph of the random digraph V induced by the vertex set V m = {v E V : s(v) G S m }. 
Note that, for i ^ j, any two vertices and u G Vj do not communicate in T>. Therefore, 

each strongly connected component of T> is a subgraph of some T> m . The asymptotic size of the 
largest strongly connected component of V m is, by Theorem 1, Ni(V m ) = p m n + op(n). Here we 
denote p m = Y2ses m Px( s )Py( s )Q( s )- ^ follows now that N\(T>) = maxi< m < r p m n + op(n). In 
this way we obtain an extension of (J3j) to general (not necessarily strongly connected) digraphs 
Dp. 

3 The general model 

The result of Theorem [1] extends to a much more general situation where the type space S 
is a separable metric space, Q is a probability measure defined on Borel sets of S. Here the 
matrix \ \pij\ \ is replaced by a non-negative S x S measurable kernel n(s,t), s,t £ S. 



\\pij\\ is irreducible. As 
(3) 
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Let xi,X2,... be a sequence of random variables with values in S such that the empirical 
distribution of the first n observations xi,...,x n approximates the measure Q in probabi- 
lity as n — » oo. That is, we assume that for each Q-continuous Borel set A C S we have 
#{i E [l> n ] : x i E ^4}n _1 = + op(l) as n — > oo. Recall that a Borel set ^4 is called 

Q-continuous whenever its boundary dA has zero probability Q(dA) = 0. 

Given n, let T> n be the random digraph on the vertex set {x}™ = {x\, . . . , x n } with independent 
arcs having probabilities P({xj — > x.,} E £> n ) = 1 A (n re(xj,Xj)), 1 < i,j < n. Combining S, Q 
and k we obtain a very large class of inhomogeneous digraphs with independent arcs. Obviously, 
the model will include digraphs with in-degree and out-degree distributions which have power 
laws. 

Such a general model, for random graphs (not digraphs), was introduced in Bollobas, Janson 
and Riordan [2]. Note that in the case of random graphs it is necessary to assume, in addition, 
that the kernel k is symmetric. In the definition of digraphs T> n , n > 2, we do not require the 
symmetry of the kernel. 

For large n, the phase transition in the digraph T> n can be described in terms of the survival 
probabilities of the related multi-type Galton-Watson branching processes with type space S. 
Given s E S, let X{s) (respectively y(s)) denote the Galton- Watson process starting at a particle 
of type s such that the number of children of types in a subset A C S of a particle of type t E S has 
Poisson distribution with mean f A n(t,u)Q(du) (respectively J.K(u,t)Q(du)). These numbers 
are independent for disjoint subsets A and for different particles. The critical point of the 
emergence of the giant SC-component is determined by the averaged joint survival probability 

Pxy = / px{s)py{s)Q{ds) (4) 
Js 

being positive. Here px{s) and py{s) denote the non-extinction probabilities of X(s) and y(s) 
respectively. In particular, for the general model of an inhomogeneous digraph, (|3|) reads as 
follows 

N\{V n ) = pxy n + op(n) as n — > oo. (5) 

In order to establish ([S]) we need to impose further conditions on the kernel k, like those in 
[2]. Namely, we need to assume that the kernel k is irreducible Q x Q almost everywhere (see 
Remark after Theorem [1]). That is, for any measurable A C S with Q(A) / 1 or 0, the identity 
Q x Q({(s,t) E Ax(S\A) : K(s,t) + 0}) = implies Q(A) = or Q(S\A) = 0, see [2]. In 
addition we assume that k is continuous almost everywhere on (S x S,Q x Q), and the number 
of arcs in D n , denoted by \T> n \, satisfies 

n _1 E|I? n | -> / / k(s, t)Q (ds)Q (dt) < oo 

J JSxS 

as n — > oo. Note that here we implicitly assume that k is integrable, i.e., k E Li(S x S,Q x Q). 
We will not give the proof of ([5]). It can be obtained from Theorem [1] via a finite dimensional 
approximation argument similar to that of the proof of Theorem 3.1 in [2]. 

4 Proof 

In the proof we shall use ideas and techniques developed in Karp [7] and Bollobas, Janson and 
Riordan [2]. 
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Proof of Theorem{l\ Given a vertex v G V, let X(v) denote the set of vertices that can be 
reached from v via d-paths, and let Y(v) denote the set of starting points of d-paths ending at 

v, 

X(v) = {u G V : v ~^ u}, Y(v) = {u G V : u v}. 

Given a function w(n) such that uj(n) — > oo and o;(n) = o(n) as n — > oo, we say that v € V 
is x—big (respectively y—big) if > w(n) (respectively, |V(u)| > oj(v)). The set of x— big 

(respectively y—big) vertices is denoted B x = B x (uj) (respectively B y = B y (to)). We write 
B = B{uj) = B x (u) n B y {uj) for the set of vertices which are x—big and y—big simultaneously. 
We show that for any such function lo, 

n _1 |i?(u;)[ — pxy = op(l) as n — > oo. (6) 

For this purpose it suffices to show that for each u we have 

rr 1 E\B(fj)\ = p X y + o(l) (7) 

and to establish ([6]) for at least one such function, say u>o(n) = Inn. 

Indeed, assume that ([7j) holds. Let u>,u>' be two such functions and B, B' denote the correspond- 
ing sets of large vertices. For the size of the symmetric difference B A B' = (BU B') B') 
equals \B(lo A u')\ - \B(oj V u')\ > 0, we obtain from (0) that E\B A B'\ = o(n). It follows that 
\\B\ — \B'\\ < \B A S'| = op(n). In particular, Q holds for every cj whenever it is satisfied by 
at least one such function uj. 

Proof of ([?p. Forward exploration. Given v G V, we explore the set -X'(u) as follows. Color all 
vertices blue. Color v white and put it in the list, which now contains a single white vertex v. 
Then proceed recursively: choose a white vertex from the list, color it black, reveal all outgoing 
arcs emerging from this vertex to blue vertices, color these vertices white and add them to the 
list. Stop when we have collected at least u>(n) vertices in the list (hence v G B x (uj)), or there 
are no white vertices left in the list ( hence we have explored entire set X(y) and v £ B x (u)). 
Let X w {v) denote the set of vertices collected in the list. Given u,w S X(v) we say that u is 
an f-child (forward child) of w if {w — > u} G T> and u was blue when w discovered it during the 
exploration process. Since the last black vertex adds to the list all its blue neighbours (endpoints 
of outgoing arcs), the size 1X^(^)1 can not exceed u(n) by more than the out-degree of the 
last black vertex. 

Backward exploration. We perform the same exploration process starting at v as above, but in the 
transposed digraph D*, which is obtained from T> by reversing direction of arcs {{v — > u} G T> <^> 
{u — > v } G T>*). That is, now the search for neighbours propagates in the reverse direction of 
the arcs of T>. Let Yu(v) be the subset of Y(v) obtained in at most u{n) steps of the exploration. 
Given u,w G Y w (v), the vertex u is called a b— child (backward child) of w if {u — > w} G T> and 
u was blue when w discovered it. Again, we have < ui(n) + A y , where now A y is the 

out-degree in T>* (in-degree in D) of the vertex last explored. 

By a coupling of an exploration process with the approximating Galton- Watson process, Bollo- 
bas, Janson and Riordan [2] showed that the fraction of large vertices (the number \B(u)\/n) 
converges in probability to the survival probability of the Galton- Watson process, see Lemma 
9.6 in [2]. Their results are stated for (undirected) random graphs, but several steps of their 
proof extend to random digraphs as well. In particular, by a coupling of the forward exploration 
process with X (backward exploration process with y) we obtain as n — > oo uniformly in v £ V 

P(i? G B x (u>)) = px{s(v)) + o(l), P(veB v (u)) = py(a(v)) + o(l), (8) 
P(A X > Inn) = 0(rr x ), P{A y > Inn) = 0{n~ l ). (9) 
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Let us show (JJJ) for u>(n) satisfying oj(n) < Inn. Introduce the events A x {v) = > u>(n)} 

and A y {v) = > u(n)}. Since \X w (v)\ > w{n) v G ^(cj) and > oj(n) 

v G B y (uj) we have P(u G B(uj)) = P(A x (v) fl .4^ (?;)). In view of ([9|) with a high probability 
each of the events A x (v) and A y (v) refer to at most ui{n) + lnn < 2 Inn vertices. Therefore, we 
may expect that for large n these events are almost independent and we have (see (|SJ)) 

P( v e b(u)) = P(A x (v) n A y {v)) = Py {s{v)) Px {s{v)) + 0(1). (10) 

We show that ()10p holds uniformly in v G V. Then ([7J) follows from (pQ), ([2]) and ()10p via the 
identities 

E|B(w)| =E^I {V6B(U))} = ^P(t; G B( W )). (11) 

Let us prove (fTUj). Let *4 = {X<j (t>) fl i^, (v) = t>} denote the event that two exploration processes 
after starting at v do not meet each other in the first uj(n) steps of the exploration. Observe, 
that uniformly in v G V 

P(A) = 1 - o(l) as n -> oo. (12) 

Indeed, assume that the set ^(ti) is already constructed and now we construct the set Y^{v). 
Note that conditionally, given X w (v) being of size at most uj{n) + Inn, each black vertex 
of Y w {v) discovers at least one b-child in X w (v) with probability at most |X a; (u)|p^n _1 < 
(u(n + lnn)p*n _1 , where p* = max^<j j<kPij- Since there are at most uj(ti) black vertices 
in Y w (v), we conclude that on the event V x = {\X u (v)\ < uj(ti) + Inn} the conditional probabi- 
lity _ 

P(A | X w {v)) < w(n)(w(n) + \an)p*n~ x . (13) 

Here A denotes the complementary event to A. It follows from (|13p that P(A fl T> x ) = o(l). 
The latter bound combined with Q shows (jl2j) . 

Now we are ready to show (|10p . Assume again that X w {v) has already been constructed. Now 
we have to construct the set Y u (v). The vertices of V = V \ X^iv) remain blue. In particular 
for every i the set V contains at least m — (u(n) + Inn) = n«(l — o(l)) blue vertices of type 
Sj. Conditionally on the event A fl P^, the exploration of l^(w) (until we stop it after at most 
u(n) steps) stays within the set V' of size n(l — o(l)). The second identity of (jHJ) applies to the 
conditional probability P[A y (v) | Xu(v),A) and yields 

P(A»|X,»,.4) =p3,( s ( w ))+ (l) 

uniformly in X w (v), satisfying the event T> x . Therefore, we have 

P(A y (v)nA x (v)nAnv x ) = P(A y (v)\A x (v)nAnv x )P(A x (v)nAnv x ) 

= p y (s{v))P(A x (v)nAnv x ) + o{i). 

In view of @, $T2\) we can replace P(Ay{v) n A x (v) Pi A Ci V x ) by P(A y (v) n Ar(«)) and 
P(A x (v)nAnV x ) by Ppt^))- Therefore, we obtain P(A x (v)nA y (v)) = p y (s(v))P(A x (v)) + 
o(l). Finally, invoking ([5]) we obtain(|10|). Thus we have proved ([7]) for oj{n) satisfying the extra 
condition u(n) < Inn. 

Let us now prove ([7]) for arbitrary u (satisfying u>(n) — » oo and u;(n) = o(n) as n — > oo). Fix 
such an to. We apply (JJJ) to u/(n) = uj(n) A ln(n) and invoking the inequality j-B(w)| < |.B(a/)|, 
we obtain the upper bound E|S(o;)| < npxy + o(n). The corresponding lower bound 

E\B(uj)\>np X y + o(n). (14) 
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follows from (|lip and the inequalities, which hold uniformly in v € V, 

P(« G B(u)) > px(s(v))py(s(v)) - (15) 

To show that (|15p holds, we first perform a forward exploration starting at v and obtain the 
set X w (v). Afterwards, in the digraph induced by the vertex set V° := (V \ U {v} we 

perform a backward exploration starting at v. The set of the thus discovered vertices is denoted 
by Y®(v). For a set V° containing at least ni — (u>(n) + Inn) = nj(l — o(l)) vertices of every 
type Sj G <S, the approximation of the distribution of the backward exploration process by the 
distribution of the Galton- Watson process y(v) remains valid. That is, the second identity of 
(J5J applies to the conditional probability P(|l^ ) (v)| > uj(n) | X^ivjj. We have 

P(|Y»| > u{n) | XUv)) = py(s(v)) + o(l), (16) 

uniformly v and in X w (v) satisfying < u;(n)+lnn. Since Y(y) contains Y®(v) as a subset, 

we obtain 

P{\Y(v)\ > u(n) | X u {v)) > py(s(v)) + o(l). 

The latter inequality in combination with the first identity of ([8|) and the first bound of ([9|) 
shows ()15p . We now have arrived at (|14|) , thus completing the proof of © . 
Proof of We prove (0) for a particular function Uo(n) = Inn. Note that follows from ([7J) 
and the bound 

Var|S(wo) | = o(n 2 ) (17) 
via Chebyshev's inequality. In addition, (|17p follows from the identities 

V a r\B(u )\ = E\B(lu )\ 2 - (Ej.B^o)!) 2 , 
E|B( Wo )|(|5(o;o)|-l) = 2E E l {veB M }h 

{u,v}cV 

combined with the identity, which holds uniformly in {u, v} C V, 

EI{„ 6 B( a , )}I{ u eB( £l , )} = px(s(v))py(s(v))px(s(u))py(s(u)) +o(l). 

The latter asymptotic identity is shown in much the same way as (jlOp above. 

^roo/ o/ (0). Write JVi = Ni(Vr,n)- Note that for every u(n) we have iVi < u(n) V |-B(o;)|. In 
combination with Q this inequality implies the upper bound 

Nm- 1 <p X y + op{\). (18) 

Here and below for a sequence of random variables {Z n } we write Z n < op(l) (respectively 
Z n > op(l)) if for every <5 > we have lim n P(Z n < 5) = 1 (respectively lim„ P(Z n > —5) = 1). 

For pxy = the result ([3]) follows from (fTBj) . For pxy > the result ([3]) follows from (fT8|) and 
the lower bound 

Nm' 1 > pxy + o P (l). (19) 

In order to show this lower bound we generate the digraph D in two steps. Firstly we generate 
a digraph V = V^ p / and then, on the top of it, we generate another digraph V" = V^i/ ^ 

independently of T>' . Here the numbers p'^ and p'-j are defined by the equations 

p'ij = Py(l - e), (1 - Pijn^il - p^n' 1 ) = 1 - pyn" 1 , 1 < i, j < k. (20) 
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Here < e < 1 is fixed and we assume that n is so large that all pijvT 1 < 1. The union T>' U T>" 
is the digraph on the vertex set V such that, for each (u, v) € [V] 2 , we have {v — » u} € V U 2?" 
whenever — ► -u} is present in at least one of the digraphs V and T>" . Note that, by the second 
equation of (|20p . the random digraphs T>' U T>" and £> have the same probability distribution. 
Let X' = {X'(s), s £ 5} and y' = {y'(s), s € S} be the multi-type Galton-Watson processes 
with Poisson offspring distributions that approximate the forward and backward explorations 
of neighbourhoods of vertices in V. They are defined in the same way as X and y above, but 
with respect to the matrix Let = px>y> be defined by (|2|). One can show (e.g., by 

coupling of X(s) with X'(s) and y{s) with y'(s)) that p' x {s) — > px{s) and p'y{s) — ► py(s) as 
e I 0. In particular, we have 

limp[ E] = p^- (21) 



We are now ready to prove (|19|) . Fix the function uj\(n) = nj Inn. Given v (zV, let X' (v) and 
Y^iy) denote the neighbourhoods of v discovered by the forward and backward explorations 
performed in T>' . Let B'{uj\) denote the set of large vertices of V. From §§§ we obtain 



n 



- 1 | j BV)|-P[ £ ]=op(1)- (22) 



We shall show that with high probability every pair of vertices from B'{uji) communicate in T>. 
This will imply that iVi is at least as large as |i?'(u;i)|, and, as a consequence, we then obtain 
the lower bound (f26|) . 

We generate the digraph T>" in k steps so that T>" = PiU - • - WDk. Here Z>j, 1 < i < k are indepen- 
dent copies of 2?|| p * where the matrix \\p*j\\ is defined by the equations 

(1 -p* n- 1 ) fc = 1 -p-'-n" 1 , 1 < i,j < k. Note that for {si Sj} G D P identities (^0]) 



imply 

P*j > k~ 1 sp ij > po, po := k^eminipij : pij > 0} > 0. (23) 

For i = 1, . . . , k denote X i (v) = {u £ V : {v ~* u} G V U V x U • • • U P*} and write V(si) = 
{v £ V : s(v) = Si}. We shall show that with high probability every set X k ~ 1 (v), v £ B'(uii), 
contains at least 0(n(lnn) _1 ) vertices of each type. More precisely, the event 

n = n veB , {uJ1 ) ni<i< fc {\x k ^(v) n V( Si )\ > n(\nn)- l >c}, 

where x := k~ 1 (po/4) k ~ 1 qi X • • • X q^, has probability 

P(H)=l-o(l) as n^oo. (24) 

Observe, that the event TL depends on the random graph T>* = T>' U 2?i • • • U Dfc-i and is 
independent of T>k- We show that on the event TC the conditional probability, which now refers 
to Dfc, satisfies 

p(Ni > TT*) = l-o(l). (25) 



This bound in combination with ()24p and (|22p implies the lower bound 

Nm- 1 >p [£] + o P {\). (26) 

Letting e J, from (f2T|) we obtain (fT9|) . We complete the proof of ([3]) by first showing ([25]) and 
then 



Proof of i25\) . Given T>*, define the events 

A uv = {3 xeX k ~ l (u), 3 jey^i)) such that {x^y}eV k }, u,v £ B'(ui). 
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Let A uv denote the event complement to A uv . Introduce the sum 

s= £ 

{u,v}cB'(u>i) 

which (given T>*) is at least as large as the number of pairs {u, v} G B'{uji) that do not com- 
municate in D. We claim that on the event Ti., we have S = with high (conditional given T>*) 
probability. Indeed, the largest of the sets Y'(v) fl V(s{), 1 < i < k is of size at least uji{n)/k. 
Assume it is the r-th set Y r := Y^ x (v) fl V(s r ). Since Dp is strongly connected, {sj — ► s r } £ Dp 
for some Sj. Given the event H, the set Ay := X k ~ l (v) n ^(sj) is of size at least 0(n(lnn) -1 ). 
Therefore, we have 

P(Auv I'D*) < (1 -p* r n- l )\ x i\*\ Y r\ < {l-pon- 1 )^*^ < dn~\ (27) 

Here c' > denotes a constant depending only on | \pij \ | and e. It follows from (|27p by Cheby- 
shev's inequality that given the event Ti we have 

1 - P(5 = 0|P*) = P(5 > 1|2>*) < E(5|D*) < c'n~ 2 . 

Since A' - ! > — S, the latter bound implies ([25]) . 

Proof of {21$ . Let P'(-) = P(-|2?') denote the conditional probability given T>' . In order to prove 
(|24p . we show that, for each v € B'(uji) and 1 < i < k, 

P'dX*" 1 ^) fl < n(lnn) _1 x) = 0(n- 4 ). (28) 

Fix u € B'{uj\) and i. The largest of sets X^ (u) fl V(sj), 1 < j < k is of size at least u>i(n)/k. 
Assume it is the r-th set X° := X' ljJ1 (v)PiV(s r ). Since L>p is strongly connected we find a shortest 
path 

s r = to ~> ti — > ■ " " ~~ > *j = s j (29) 
in Dp. Denote x m = A; _1 (p /4) m Q(ii) • ■ ■ Q(t m ). We claim that, for 1 < m < j, 

P'(|X"» n V(t m )\ < n{\nn)- l x m ) = 0(n- A ). (30) 

Note that, in view of the inclusions X'(v) = X°(v) C X 1 (u) C ••• C X k ^ 1 (v) and the 

inequalities k < tq, the bound (f30|) (for m = j) implies ([28]) . We show ([30|) . for m = 1, 2, 

Let m = 1. Let A 1 denote the set of endpoints in V(t%) of arcs in T>\ that start at vertices in 
A . We have 

\X\v)nV(t 1 )\>\X x \= Hue*} 

The right-hand sum has binomial distribution Bi(\V(ti)\,pi) with |V(fi)| = nQ(t{) + o(n) trials 
and success probability 

Pi > 1- (l-pon -1 )'* ! > 1 - (l-p ri -1 ) a,l(n)/ * : . 

Indeed, po n_1 is the smallest probability of arcs in £>i, and there are at least |A°[ > ui(n)/k 
vertices in X°(v) that "try" to send an arc to a given vertex w € ^(^i)- A simple analysis 
show that p\ = 0(/n _1 (n)). In particular, for large n we have p\ > po/ '(2klnn). Therefore, 
we obtain EJA 1 ) > 2xin(lnn) _1 (l + o(l)). Now, an application of Chernoff bound (see, e.g., 
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Janson, Luczak and Rucinski [6j) to the binomial random variable shows (|30p for m = 1. 
Next, given the event {{X 1 ^) n > n(ln n) which is independent of T>2, we show 



P[\X 2 (v) nV(t 2 )\ < n(lnn) _1 x 2 P',PiJ = 0(n~ 4 ). 

in much the same way. That is, we show that number of different endpoints in Vfo) of arcs 
in T>2 that start in X^{v) n V(t\) is strongly concentrated around 2x2n(hm)~ 1 . In this way, 
proceeding along the path (|29p until the last endpoint Sj and using arcs from T>i, T>2 etc. for 
the successive steps respectively, we arrive at (f30|) . 

In the very last step of the proof we show N2 = op(n). Indeed, this bound follows immediately 
from O, (H2|) via the simple inequality N x + N 2 < 2u(n) + \B(u)\. □ 

Acknowledgement. Mindaugas Bloznelis would like to thank Colin Cooper for valuable and 
inspiring discussion. 
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